CALDER 




National 

Center / or Analysis o/ Longitudinal Data in Education Research 



TRACKING EVERY STUDENT'S LEARNING EVERY YEAR 



Urban Institute 



^ A program of rtsearch by tht Urban Institute with Duke Unieersity, Stanford University, University of Florida, 
University of Missouri-Columbia, University of Texas at Dallas, and University of Washington 



The Narrowing Gap in 
New York City Teacher 
Qualifications and 
Its Implications for 
Student Achievement in 
High-Poverty Schools 

Donald Boyd, Hamilton 
Lankford, Susanna Loeb, 
Jonah Rockoff, 

AND James Wyckoff 



WORKING PAPER 10 • SEPTEMBER 200J 






The Narrowing Gap in New York City Teacher Qualifications and its 
Implications for Student Achievement in High-Poverty Schools 



August 2007 



Donald Boyd 
University at Albany 

Susanna Loeb 
Stanford University 

James Wyckoff 
University at Albany 



Hamilton Lankford 
University at Albany 

Jonah Rockoff 
Columbia University 



We are grateful to the New York City Department of Education and the New York State 
Education Department for the data employed in this paper. Vicki Bernstein, Katherine Boisture, 
Joe Erey, Robert Gordon, Brian Jacob, Sarah Shafer, Nancy Taylor-Baumes and Nancy Willie- 
Schiff provided helpful comments on an earlier draft. We appreciate financial support from the 
Carnegie Corporation of New York, the National Science Eoundation, the Spencer Eoundation 
and the National Center for the Analysis of Eongitudinal Data in Education Research (CAEDER). 
CAEDER is supported by lES Grant R305A060018 to the Urban Institute. The views expressed 
in the paper are solely those of the authors and may not reflect those of the funders. Any errors 
are attributable to the authors. 




I. Introduction 



What is the distribution of educational resources across schools and what effect do 
disparities in resources have on the achievement of poor and minority students? This question 
dates to the Coleman Report (1966), but continues to be hotly debated, involving the courts as 
well as federal, state and local governments. Arguably the most important educational resource is 
teachers. Disparities in teacher qualifications figure prominently in most educational policy 
discussions and are a central feature of the No Child Left Behind Act of 2001 (NCLB) which 
requires a "highly qualified teacher" in every classroom in a core academic subject. Many states 
and large districts also have policies in place to attract teachers to difficult-to- staff schools (Loeb 
and Miller, 2006). 

The recent interest in teacher labor markets stems in part from recognition of the importance 
of teachers and from the recognition of substantial differences across schools in the qualifications 
of teachers. A consistent finding in the research literature is that teachers are important for 
student learning and that there is great variation in effectiveness across teachers (Sanders and 
Rivers, 1996; Aaronson, Barrow and Sander, 2003; Rockoff, 2004; Rivkin, Hanushek and Kain, 
2005; Kane, Rockoff and Staiger, 2006). Thus, understanding what makes an effective teacher as 
well as how teachers sort by their effectiveness across schools is central to understanding and 
addressing student achievement gaps. 

Prior studies have found substantial sorting of teachers across schools with the schools with 
the highest proportions of poor, non-white, and low-scoring students having the least qualified 
teachers as measured by certification, exam performance, and inexperience (Lankford, Loeb and 
Wyckoff, 2002). Yet, there have been substantial changes in the educational policy landscape 
over the past five years. New laws, including NCLB, have changed requirements for teachers. 
Assessment-based accountability policies at the state-level have created standards and increased 
oversight of schools, especially those with low-achieving students. New routes into teaching, 
many with fewer requirements before teaching, have changed the cost for individuals to enter the 
teaching profession. These changes have affected teacher labor markets profoundly. 

In this paper we examine these changes, asking how the distribution of teachers has changed 
in recent years and what the implications of these changes are for students. We examine three 
questions: 

■ How has the distribution of teaching qualifications between schools with concentrations 
of poor students and those with more affluent students changed over the last five years? 



1 




■ What effects are the changes in ohserved teacher qualifications likely to have on student 
achievement? 

■ And, what implications do these findings have for improving policies and programs 
aimed at recruiting highly effective teachers? 

This study uses data on New York City teachers, students, and schools to address these questions. 
While the findings could he specific to New York City, they may mirror changes in other large 
urban districts, many of which have seen similar policy changes over the past decade. 

We find that measurable characteristics of teachers are more equal across schools in 2005 
than they were in 2000. Schools with large proportions of poor students and students of color, on 
average, have teachers whose observable qualifications are much stronger than they were five 
years ago. Nonetheless, a meaningful number of schools with large proportions of poor students 
did not demonstrate such improvement. We find that changes in these observed qualifications of 
teachers account for a modest improvement in the average achievement of students in the poorest 
schools. More importantly our results suggest that recruiting teachers with stronger observed 
qualifications, e.g., math SAT scores or certification status, could substantially improve student 
achievement. 

II. Background 

A growing literature finds that teachers “sort” very unequally across schools, with the least- 
experienced teachers and those with the poorest academic records often in schools with the 
highest concentrations of low-income, low-performing and minority students (See, for example, 
Betts, Reuben and Danenberg, 2000; Lankford, Loeb, and Wyckoff, 2002; Bonesrpnning, Falch, 
and Str0m 2005; Clotfelter, Ladd and Vigdor, 2006; and Peske and Haycock, 2006). Across 
several different states and at least one other country, low-performing, poor, and minority 
students systematically are taught by teachers with the weakest credentials, such as certification 
status and exam scores, SAT scores, ranking of undergraduate college, and, importantly, teaching 
experience. As but one example, Lankford, Loeb and Wyckoff (2002) find systematic sorting of 
New York State’s elementary school teachers in 2000. Non-white students were four times more 
likely than white students to have a teacher who was not certified in any of the courses he or she 
taught and 50 percent more likely to have a teacher with no prior experience. The sorting of 
teacher qualifications within districts can also be substantial. In New York City elementary 
schools in 2000, non-white students were 40 percent more likely to have a teacher who was not 
certified in any of the courses she taught and 40 percent more likely to have a teacher with no 
prior experience. This sorting resulted from teachers’ choices about whether and where to start a 



2 




teaching career, whether and where to remain in teaching - choices made within the constrained 
labor market governed hy administrator choices, teacher contracts, and state and district 
regulations (For a more complete discussion of teacher sorting see Boyd, Lankford and Wyckoff 
2007). 

Teachers significantly influence student achievement (Sanders and Rivers, 1996; Aaronson, 
Barrow and Sander, 2003; Rockoff, 2004; Rivkin, Hanushek and Kain, 2005; Kane, Rockoff and 
Staiger, 2006). Sanders and Rivers (1996) estimate that differences in teacher quality can provide 
up to a 50 percentile improvement in student achievement and that these improvements are 
additive and cumulative over subsequent teachers. Kane, Rockoff and Staiger (2006) estimate 
that the difference in effectiveness between the top and bottom quartile of teachers results in a .33 
standard deviation difference in student gains over the course of a school year. 

While there is consensus that more effective teachers produce dramatically greater student 
achievement than less effective teachers, there is much less consensus on the attributes of 
teachers responsible for these differences. Much, though not all, of the recent research examining 
teacher effectiveness concludes that some teachers’ attributes, such as higher test scores and 
greater teaching experience, will produce students with higher achievement. However, the effects 
of most teacher attributes appear small in comparison to the substantial variation across students 
in how much they learn in a year, as measured by test score gains. Studies of teachers’ value- 
added to student achievement use state or district administrative data and thus are usually limited 
to assessing the effects of teacher characteristics collected by these entities. Teacher experience 
and certification are among the most studied. 

Students of first year teachers learn less, on average, than students of more experienced 
teachers. This difference could be driven either by the improvement of teachers or by differential 
attrition of the worst teachers. If worse teachers are more likely to leave after their first year, as 
found by Boyd, et al. (2007) in New York City, then at least some of the better average 
performance of more experienced teachers would be due to compositional change instead of 
improved teaching. Removing the effects of compositional change, studies find that first-year 
teachers produce student achievement gains that are from .03 to .20 standard deviations less than 
otherwise similar teachers with ten to fifteen years of experience (Rockoff, 2004; Rivkin, 
Hanushek and Kain, 2005; Kane, Rockoff and Staiger, 2006). Most of these gains from 
experience occur within the first four years of teaching. 

Many studies examine the effect of teacher certification on student achievement. These 
studies differ, sometimes substantially, in their findings. Most likely do not account adequately 
for the systematic differences in the schools in which the average certified teachers and the 



3 




average uncertified teacher work. However, three recent studies with strong research designs and 
good data are able to address how teacher certification affects student achievement (Goldhaher, 
2006; Clotfelter, Ladd and Vigdor, 2006, and Boyd, Grossman, Lankford, Loch and Wyckoff, 
2006). In both North Carolina and New York City, these studies find that the students of teachers 
with certification outperform those whose teachers are uncertified. The achievement effect of 
certification is about 2 to 4 percent of a standard deviation in math, which is about half as large as 
the gain resulting from the first year of teacher experience. The effect in reading is about half this 
size. These findings should not be confused with the effect of a system without certification. For 
example, many of the uncertified teachers are uncertified because they were unable to pass their 
certification exam, even though they completed much of the same course work and field 
experiences as certified teachers. It is very difficult to predict how the composition of teachers 
would change in the absence of certification requirements. 

The studies described above address the effects of specific teacher attributes. The effects in 
most cases appear to be modest. However, the variation in teacher attributes across schools is not 
independent. That is schools with the highest proportion of first year teachers also tend to have 
the highest proportion of uncertified teachers and the lowest prior academic performance of 
teachers. Teacher attributes vary together, and thus they should be taken together when 
considering the true difference in the effectiveness of teachers serving different student 
populations. In this paper, we assess the total effects of the differences in measurable 
characteristics of teachers across schools. We trace changes in the distribution of teachers across 
schools in New York City from 2000 to 2005 and estimate the effects that these changes are 
likely to have had on students in the traditionally most difficult-to-staff schools. 



III. Data 

The analysis is divided into two sections. The first section examines how the sorting of 
teacher qualifications across schools, categorized by poverty status and the racial-ethnic 
composition of students, has changed between 2000 and 2005. We then estimate how the 
changing composition of teacher qualifications affected student achievement gains. The analyses 
draw on a rich database constructed from administrative data from the New York City 
Department of Education, the New York State Education Department, alternatively certified 
teacher programs, and the College Board. 

New York State gives statewide student exams in mathematics and English language arts in 
4* and 8* grade. In addition, the New York City Department of Education tests 5*, 6* and 7* 



4 




graders in these subjects. All the exams are aligned to the New York State learning standards and 
each set of tests is scaled to reflect item difficulty and are equated across grades and over time. ^ 
Tests are given to all registered students with limited accommodations and exclusions. Thus, for 
nearly all students the tests provide a consistent assessment of achievement for a student from 
grade three through grade eight. 

To analyze the relationship between teacher qualifications and student achievement, we 
create a student database with student exam scores, lagged scores and characteristics of students 
and their peers linked to their schools and to teachers and characteristics of those teachers. The 
student data, provided by the New York City Department of Education (NYCDOE), consists of a 
demographic data file and an exam data file for each year from 1998-99 through 2004-05. The 
demographic files include measures of gender, ethnicity, language spoken at home, free-lunch 
status, special-education status, number of absences, and number of suspensions for each student 
who was active in grades three through eight that year - approximately 450,000 to 500,000 
students each year. 

The exam files include, among other things, the year in which an exam was given, the grade 
level of the exam, and each student’s scaled score on the exam. Eor most years, the file contains 
scores for approximately 65,000 to 80,000 students in each grade. The only significant exception 
is that the files contain no scores for 7* grade English language arts in 2002 because the New 
York City Department of Education is not confident that exam scores for that year and grade were 
measured in a manner that was comparable to the 7* grade English language arts exam in other 
years. 

Using these data, we construct a student-level database where exam scores are normalized for 
each subject, grade and year to have a zero mean and unit standard deviation to accommodate any 
year-to-year or grade-to-grade anomalies in the exam scores. Eor this purpose, we consider a 
student to have value-added information in cases in which he/she has a score in a given subject 
(EEA or math) for the current year and a score for the same subject in the immediately preceding 
year for the immediately preceding grade. We did not include cases in which a student took a test 
for the same grade two years in a row, or where a student skipped a grade. 

To enrich our data on teachers, we match New York City teachers to data from New York 
State Education Department (NYSED) databases and College Board databases, using a crosswalk 
file provided by NYCDOE that links their teacher file reference numbers to unique identifiers 

* The mathematics exams in all grades are developed by CTB-McGraw Hill. New York State employs 
CTB-McGraw Hill for its 4* and 8* grade ELA exams. In 2003 New York City switched from CTB to 
Harcourt Brace for its 3"^^, 5*-7* grade exams. At that time there was an equating study done to 
accommodate the switch in exams. 



5 




compatible with both databases. We drew variables for NYC teachers from these data files as 
follows: 



■ Teacher Experience: For teacher experience, we used transaction data from the 
NYCDOE Division of Human Resources payroll system to calculate experience in 
teaching positions in the New York City public school system. 

■ Teacher Demographics: We drew gender, ethnicity, and age from a combined analysis 
of all available data files, to choose most-common values for individuals. 

■ Undergraduate: We identified the institutions from which individual teachers earned 
their undergraduate degrees using the NYS Teacher Certification Database (TCERT) 
and combined it with the Barron's ranking of college selectivity to construct variables 
measuring the selectivity of the college from which each teacher graduated. 

■ Certification: We identified current certification areas from the NYS Teacher 
Certification Database (TCERT). 

■ SAT scores: We obtained SAT scores for all individuals taking the SAT in New York 
State through 2002 from the College Board. 

■ Test performance: We drew information regarding the teacher certification exam scores 
of individual teachers and whether they passed on their first attempts from the NYS 
Teacher Certification Exam History File (EHF). 

■ Pathway: Initial pathway into teaching comes from an analysis of teacher certification 
applications plus separate data files for individuals who participated in Teach For 
America (TEA), the Teaching Fellows Program, and the New York City Teaching 
Opportunity Program, obtained directly from program officials. 

■ College Recommended: We obtained indicators for whether an individual had 
completed a college-recommended teacher preparation program and if so, the level of 
degree obtained (bachelor’s or master’s), from NYSED’s program completers data files. 

Finally, we match teachers and students to their schools, and incorporate data on those 
schools from the New York City Department of Education Annual School Report database, 
including: 

■ School-average performance on state and city standardized exams 

■ Poverty status as measured by the percentage of students eligible for Free Eunch 

■ Racial and ethnic breakdown of students 

■ Expenditures per pupil 

The analysis of teacher sorting links teachers to schools and places schools into poverty 
groups based on the percentage of children eligible for free lunch in the first year a school 
appears in our database. We use a fixed school poverty group for each school so that it will not be 
influenced by year-to-year changes in reported free lunch percentages that sometimes appear 
spurious.^ In defining groups, we weight each school by the number of teachers in our data, so 



^ In analysis that is not presented, we allowed the composition of quartiles to vary over time as quartile 
boundaries and school poverty values change. These results are available from the authors. The results 
presented are not sensitive to this distinction. 



6 




that a school with many teachers will count more than a school with few teachers. The poverty 
groups are defined separately for elementary schools, middle schools, and high schools. In 
addition, for most of our analysis we only include schools present in both 2000 and 2005, so that 
the analysis will not he affected hy changes in classifications of schools.^ We take a similar 
approach for categorizing schools based on race and ethnicity. 

IV. The Changing Distrihution of Teacher Qualifications 

The analysis below uses several indicators of teacher qualifications that researchers have 
previously employed to describe the teaching workforce. These measures include teaching 
experience, performance on state teacher certification exams, certification status and area, 
competitiveness of a teacher’s undergraduate institution, pathway into teaching, and SAT scores. 
As discussed later, each of these measures appears likely to bear some relationship to student 
achievement, although the relationships are not always consistently strong or large in magnitude. 

We analyze the distribution of teacher qualifications by the poverty status of students in the 
schools where these teachers work. There is substantial variation across the poverty groups in the 
percentage of students eligible for free lunch, as shown in the last row of Table 1. However, in 
New York City even schools in the decile or quartile with the lowest percentage of free lunch- 
eligible students contain some students who are poor using this proxy. Thus, when we employ 
the terms affluent or rich in describing schools, this is a relative concept. By these measures, the 
distribution of teachers in 2000 was unequal. For example. Figure 1 shows that high-poverty 
schools were far more likely to have novice teachers: 25 percent of teachers in schools in the 
highest-poverty group (top 10 percent) were in their first two years of teaching, compared with 15 
percent of teachers in the lowest-poverty group (bottom 10 percent). Table 1 shows that these 
patterns held across other available measures of teacher qualifications: teachers in the highest- 
poverty schools failed the Liberal Arts and Sciences Test (LAST), a state teacher certification 
exam that measures general knowledge, nearly three times as frequently as did teachers in low- 
poverty schools; they were much more likely to have graduated from the least-competitive 
colleges; and they had much lower scores on SAT exams than did teachers in low -poverty 
schools. 



^ We also examine teacher sorting for all schools and with the exception of a somewhat larger narrowing 
of the gap in the percentage of novice teachers across poverty quartiles, the results are insensitive to this 
change. These results are available from the authors. 



7 



The Narrowing Gap 

Between 2000 and 2005 there was a remarkable narrowing in the gap in teacher 
qualifications between high-poverty schools and low-poverty schools. In particular, the high- 
poverty schools improved considerably while the low-poverty schools either did not improve or 
did so only slightly. Figure 2 illustrates the narrowing of the gap in the failure rate on the LAST 
exam. In 2000, 35 percent of teachers in the highest-poverty quartile of schools failed the LAST 
the first time they took the exam, compared with 15 percent in the lowest-poverty quartile, for a 
gap of 20 percentage points. By 2005, less than 25 percent of teachers in the highest-poverty 
quartile had failed the LAST on the first attempt, while the lowest-poverty quartile actually 
remained constant, so the gap narrowed by ten percentage points, or half its level five years 
earlier. Figure 3 shows a similar trend for teacher experience. In 2000, just over 25 percent of 
teachers in the highest-poverty quartile of schools had less than three years of experience, 
compared with slightly more than 17 percent in the lowest-poverty quartile of schools, for a gap 
of eight percentage points. By 2005, 22 percent of teachers in the highest-poverty schools were 
novices, narrowing the gap to about six percent points. ^ 

Table 2 shows that the same basic pattern held with other teacher qualifications, including 
SAT verbal and math scores, and the percentage who attended least-competitive colleges. In 
general, the gap between the lowest and highest poverty schools narrowed as a result of 
substantial improvements in the highest poverty schools. Table 2 also shows expenditures per 
pupil and average teacher salaries (which are available for 2005 but not for 2000). Expenditures 
per pupil were higher in high-poverty schools than in low-poverty schools in both years, and the 
difference actually increased between 2000 and 2005. Although total spending was higher in 
high-poverty schools, average teacher salaries are higher in the low-poverty schools. The 
differences in teacher salaries reflect the remaining difference in teacher experience between low 
and high poverty schools. 

There are similar trends in teacher qualifications across schools by grade levels; however 
elementary schools experienced the greatest narrowing in the teacher qualifications gap. For 
example, as shown in Appendix Table 3a, the novice experience gap between high-poverty and 
low-poverty elementary schools in 2000 was 12 percentage points. By 2005 that had diminished 
to 5.6 percentage points. Similarly, the gaps in passing the LAST exam and SAT scores were 

Very similar changes in teacher qualifications occur when schools are categorized by the proportion of the 
school’s students who are black or Hispanic; the proportion of students in the school who failed to reach 
proficiency on the state fourth grade math exam; and the proportion of students in the school who failed to 
reach proficiency on the state fourth grade reading exam. In each case, the initial gaps and the closing of 
the gaps in teacher qualifications from 2000 to 2005 follows the same pattern as shown for the schools 
arrayed by poverty. These results are shown in Appendix tables 1 through 3. 



8 




reduced by 50 percent. Although middle schools also had a novice experience gap of over 1 1 
percentage points in 2000, there was no meaningful reduction by 2005 (Appendix Table 3b). A 
much smaller percentage of middle school teachers failed the LAST exam initially than was the 
case for elementary school teachers and the middle school failure rate declined only modestly 
between 2000 and 2005. Finally, high schools experienced some meaningful improvement 
between 2000 and 2005 (Appendix Table 3c). On most measures the narrowing of the gap in 
qualifications fell between those of elementary schools and middle schools. 

Not all poor schools experienced an improvement in teacher qualifications over this period. 
Figure 4 shows the distribution of change between 2000 and 2005 in the school average 
proportion of teachers who failed the LAST exam on their first attempt. Most schools in the 
poorest decile experienced a reduction in proportion failing, as indicated by the large portion of 
the distribution with negative changes. Flowever, 20 percent of the poorest schools experienced 
an increase in the proportion failing (those to the right of zero axis, although in many cases the 
increase was small. Similar results hold for the other measures of teacher qualifications. 
Nonetheless, a substantial proportion of the high-poverty schools did not share in the improved 
qualifications of teachers. 

Explaining the Change 

To further understand the recent change in teacher sorting it is worth asking to what extent the 
change is driven by new hires as opposed to the behaviors of more experienced teachers. Little of 
the change in teacher qualifications among poverty quartiles between 2000 and 2005 is 
attributable to the transfer and quit behavior of teachers. Figure 5 shows how the average first 
time failure rate on the LAST exam for those teaching in 2000 changes over time as that group 
moves across schools or leaves teaching in New York City. In 2000, the difference between the 
lowest and highest poverty quartiles of the first-time failure rates on the LAST exam is 23 
percentage points. The gap remains unchanged in 2005. Similar results hold for other measures 
of teacher qualifications. It is evident that the transfer and quit behavior of teachers had little to 
do with the reduced gap in teacher qualifications. 

As illustrated by Figure 6, the dramatic reductions in the teacher-qualifications gap have 
been driven primarily by changes in the qualifications of newly hired teachers and the ways in 
which they vary with the poverty status of schools. Figure 6 shows that the average failure rate on 
the LAST exam of newly hired teachers converged between 2000 and 2003, so that from 2003 
forward the failure rate was about the same across poverty categories of schools. A similar 
convergence occurred for SAT scores, but not for the competitiveness of colleges attended by 
teachers. 



9 




The pattern of improving and converging qualifications of new teachers is driven 
primarily hy three policy changes: (1) In 1998 the New York State Board of Regents 
recommended abolishing temporary licenses for uncertified teachers effective Septemher 1 , 2003 
and — except for limited waivers in New York City for 2004 and 2005 — this was accomplished. 
(2) in 2000 the Regents created alternative certification routes that would allow school districts to 
hire teachers who are participating in approved alternative certification programs to become 
teachers as long as they were able to pass required teacher certification exams, and (3) in 
collaboration with The New Teacher Project, the New York City Department of Education 
developed the Teaching Fellows program and in 2000 selected its first cohort of Fellows. Fellows 
grew from about 1 percent of newly hired teachers in 2000 to 33 percent of all new teachers in 
2005, as Figure 7 shows. Over the same period, temporarily licensed teachers fell from 53 percent 
of new hires to 3 percent. 

The shift in the entry pathway of teachers has had a large impact on the distribution of 
teacher qualifications for two reasons. First, Teaching Fellows and TFA teachers on average have 
test scores and prior academic experiences that are stronger than those of other teachers, and 
much stronger than those of temporarily licensed teachers. For example, only 5 percent of newly 
hired Teaching Fellows/TFA teachers in 2003 failed the FAST exam on their first attempt, while 
16.2 percent of newly hired traditional teachers failed the FAST exam, and fully 32.5 percent of 
temporarily licensed teachers failed the FAST exam. Second, newly hired Teaching Fellows and 
TFA teachers are placed disproportionately in high-poverty schools, as were their temporarily 
licensed predecessors. Between 2000 and 2005, 44 percent of newly hired Teaching Fellows and 
TFA teachers were placed in schools in the highest-poverty quartile; and, by 2005, 40 percent of 
all new hires in the highest poverty quartile were Teaching Fellows or TFA corps members. In 
2000, before Fellows and TFA teachers were significant in numbers, 63 percent of newly hired 
teachers in the highest poverty quartile were temporarily licensed teachers. The hiring of Fellows 
and TFA teachers into high poverty schools, instead of temporarily licensed teachers, has been 
responsible for much of the narrowing of the gap in teacher qualifications between high-poverty 
and low-poverty schools.^ 



^ One additional factor which may have also helped contribute to these changes is a considerable increase 
in the salaries of teachers in New York City, particularly for new teachers. The starting salary for a teacher 
with no experience and a bachelor’s degree rose from $33,186 in 2000 to $39,000 in 2003. This salary 
schedule applies to teachers in all schools, regardless of poverty, and thus it is difficult to establish any 
direct link between salaries and the sorting of teachers. However, it is quite plausible that higher salaries 
for new teachers aided the recruitment and retention of Teaching Fellows and other highly qualified 
individuals choosing to teach in high poverty schools. 



10 




IV. The Relationship between Teacher Qualiflcations and Student Achievement 

Over the same period in which the gap in teacher qualifications narrowed, the gap in the 

proportion of students failing to meet proficiency standards also narrowed. In the 2000 school 
year, 30 percent of students in the lowest-poverty group failed to meet state proficiency standards 
on the grade 4 ELA exam, ® while 74 percent of students in the highest-poverty schools failed to 
meet the state standard, for a gap of 44 percentage points. Between 2000 and 2005 failure rates 
declined in all poverty groups as shown in Table 3 hut they declined hy the most in the highest- 
poverty schools so that the gap between low and high-poverty groups narrowed to 32 points. 

Table 3 shows that this narrowing of the percentage of students reaching proficiency 
between high- and low -poverty schools occurred across all four major state exams - ELA for 
grade 4 and 8, and math for grade 4 and 8, although middle school tests showed only a slight 
closing of the gap. We also have examined other measures of the achievement gap, including 
average test scores by school, and the percentage of a school’s students scoring at Level 4 (the 
highest level of performance). By all measures except the Level 4 percentage for 8* grade ELA, 
the achievement gap between high-poverty and low -poverty schools narrowed between 2000 and 
2005. In general, achievement in high-poverty schools has improved and come closer to that of 
low-poverty schools; although in some cases the effects are not large. 

However, while the narrowing of student achievement across poverty groupings of 
schools occurred concurrently with the narrowing of the teacher-qualifications gap across these 
groupings, the causal relationship between the two trends is not clear. The change in teachers 
may have caused the change in student outcomes; the change in student outcomes may have 
caused the change in teachers; a third factor may have led to both changes; or, alternatively, they 
may have separate though simultaneous causes. Whether teacher qualifications played a role in 
this narrowing is an open question. While we can not determine the complete causal mechanism 
we can predict how much a change in measurable characteristics would, on average, affect 
student outcomes. The prediction may under or over estimate the effects of the changes in 
teacher sorting on student achievement depending on how unmeasured characteristics of teachers 
changed during this same time period. If teacher sorting reduced on positive unmeasured as well 
as measured characteristics then the estimates will underestimate the teacher effects; if teacher 



® New York’s student achievement data for statewide standardized exams place each student’s test results 
in one of four performance levels, with levels 1 and 2 designated as not meeting proficiency. Level 1 for 4* 
grade ELA is described by the New York State Education Department as, “These students have serious 
academic deficiencies. They show no evidence of any proficiency in one or more of the elementary 
standards and incomplete proficiency in all three standards.” 



11 




sorting increased on positive unmeasured characteristics, then we will overestimate the total 
teacher effect. 

Estimating the Effects of Measured Teacher Characteristics 

It is not easy to estimate how the achievement gains of students are affected hy the 
qualifications of their teachers because teachers are not randomly sorted into classrooms. For 
example, if teachers in schools in which students perform best in math are more likely to be 
certified in math, one might be tempted to conclude that being certified to teach math contributes 
to higher student achievement. The causal relationship, however, may operate in the other 
direction; that is, more qualified teachers may be in schools where students perform well in math 
because they prefer to teach good students and because employers want to staff their courses with 
in-field certified teachers. Analysts need to be careful not to attribute the test-score gains 
associated with sorting to the attributes of teachers. Unfortunately, there is not a specific agreed- 
upon methodology for answering this question in a non-experimental framework. Because of 
this, we choose to run a number of different specifications in order to test the robustness of the 
estimated effects. 

Equation 1 summarizes our base model for estimating teacher attribute effects. 

Asgty - As>g(g-l)t>(y-l) = Yo + Y iSiy + Y3 Cty + Y4Tjy + 71, + Tig + Tty + S,,g,y (1) 

Here the standardized achievement gain score of student i in school s in grade g with teacher t in 
year j is a linear function of time varying characteristics of the student 5, characteristics of the 
other students in the same grade with the same teacher in that year C, and the teacher’ s 
qualifications T. The model also includes student, grade and time fixed effects and a random 
error term. The time- varying student characteristic is whether the student changed schools 
between years. Class variables include proportion of students who are black or Latino, the 
proportion who receive free or reduced price school lunch, the class size, the average number of 
student absences in the prior year, the average number of student suspensions in the prior year, 
the average achievement scores of students in the prior year, and the standard deviation of student 
test scores in the prior year. Teaching experience is measured by separate dummy variables for 
each year of teaching experience up to a category of 21 and more years. Other teacher 
qualifications include whether the teacher passed the general knowledge portion of the 
certification exam on the first attempt, certification test scores, whether and in what area the 
teacher was certified, the Barrens ranking of the teacher’s undergraduate college, math and verbal 



12 




SAT scores^, the initial path through which the teacher entered teaching, e.g., a traditional college 
recommended program or the New York City Teaching Fellows program, and an interaction term 
of the teacher’s certification exam score and the portion of the class eligihle for free lunch. The 
standard errors are clustered at the teacher level to account for multiple student observations per 
teacher. * 

Student achievement gains are measured as the difference between the student’s test 
score in a given year and his or her test score in the prior year. Student achievement gains are 
computed after normalizing test scores to have zero mean and unit standard deviation for each 
year and grade. Based on the differential pattern of teacher sorting between elementary and 
middle schools described above and earlier research that finds differences in the determinants of 
student achievement across grade levels (Boyd et al. 2006), we estimate four models: separate 
models for math and ELA, and separate models for students in 4* or 5* grades and those in 6* 
through 8*. We present only the math results; the effect of observed qualifications on student 
achievement in ELA in both grade groupings is very small. 

Many of the measures of teachers’ qualifications are highly correlated with each other in our 
sample. The LAST certification exam score and the verbal SAT are correlated at 0.68; attending 
a most competitive undergraduate college is correlated with the verbal SAT at 0.35; and 
certification to teach math and entering teaching through the New York City Teaching Eellows 
program have a correlation of 0.30. As a result, including them all in one large regression may 
understate the importance of individual qualifications in affecting student achievement. Eor 



^ We impute values for SAT scores and the LAST certification exam for all teachers with missing values. 
We observe SAT’s for every person who took the SAT in New York from 1980 until 2000. Thus we may 
be missing SAT scores for three groups: those who took the SAT prior to 1980 and thus are likely to be 
more experienced teachers; those who took the SAT in another state, and those who never took the SAT. 
We do not observe SAT scores for about 53 percent of the teachers in our sample. Two-thirds of the 
teachers for whom we are missing SAT scores were born prior to 1963 and thus were younger than 17 in 
1980, when our SAT data begin. 

Finally, New York State switched teacher certification exams from the Educational Testing Service (ETS) 
general knowledge exam to an exam designed for New York State by National Evaluation Systems (NES) 
in 1995. Because our sample includes teachers who took the ETS exam, we create a dummy variable that 
indicates if a teacher passed either exam the first time they took it. In addition, we impute values of the 
LAST for those who did not take it. 

Our imputations are guided by a growing literature (see for example Cameron and Travidi, 2005). 
Consistent with this literature, we employ a model based approach to imputing SAT and LAST scores for 
missing observations. As shown in our results presented below, we have examined several alternative 
models to explore the robustness of our results to the imputation of SAT and LAST scores. 

* We also estimate the model with student achievement level as the dependent variable, the previous year’s 
achievement and its square as independent variables along with all other independent variables and a school 
fixed effect, omitting the student fixed effect, and obtain results that are remarkably similar to those 
presented for student fixed effects. The effect of employing this model in assessing the effect of teacher 
observables on student achievement is presented below; a full set of coefficient estimates is available from 
the authors. 



13 




example, as shown in Table 4, while teacher experience is statistically significant and appears 
important, few of the other measures of teacher qualifications are, even though if entered alone 
they would have been. 

The gains to teacher experience can serve as a benchmark against which to judge the effect 
size of other teacher qualifications. As discussed above, the coefficient estimates for experience 
in Table 4 may provide misleading estimates of the gains that accrue to teacher experience. 

These results are a combination of teacher improvement with experience and teacher attrition. 
Figure 8 shows the gains to experience for math achievement in a model that employs teacher 
fixed effects and thus increments to value added are identified only from teachers who persist 
from one year to the next. As shown, teachers continue to improve the achievement outcomes of 
their students over the first 3 to 5 years of their careers. The effect of moving from being 
completely inexperienced to having a full year of experience is the largest gain and in our sample 
of 4* and 5* grade math achievement is about .06 standard deviations. 

Other measures of teacher qualifications also are related to student achievement gains. Not 
being certified at the time a teacher taught the course reduces student achievement by 0.042 — 
roughly two-thirds the size of the gain of the first-year of teaching experience, which most 
observers agree is important. A similar size effect results from improving math SAT scores by 
one standard deviation improves student achievement by 0.041. Flaving a teacher who attended a 
competitive undergraduate college improves performance relative to one who attended a less 
competitive college, but the effect is small (.014). 

The Combined Effect of Teacher Characteristics 

Although some of the individual qualifications described above affect student outcomes 
in important ways, often the effects are relatively small in magnitude when compared with the 
variation in student learning over a school year. However, the rather substantial changes in 
teacher qualifications in the poorest schools during the 2000 to 2005 period occurred across a 
variety of measures. The effects of these joint changes are likely to be greater than changes in a 
single measure holding other attributes constant. In order to estimate the combined effect of the 
change, we use the coefficient estimates for the teacher variables presented in Table 4 and the 
actual qualifications of teachers in the poorest and most affluent deciles of schools in 2001 and 
2005 to predict the student achievement gains attributable solely to changes in teacher 
qualifications.^ 



^ To insure stability in the predictions, we employ averages of teacher qualifications in 2000 and 2001 
(labeled 2001) and 2004 and 2005 (labeled 2005). 



14 




As shown in Figure 9, the improvement in qualifications increased predicted student 
achievement in the poorest decile, shifting the overall distribution to the right between 2001 and 
2005. On average the change in qualifications of teachers increased student achievement by 
0.029 standard deviations in the decile of schools with the highest concentration of students in 
poverty. The predicted student gains in the most affluent decile of schools improved by 0.007. 
Therefore, as a result only of the change in observed teacher qualifications, the gap between the 
poorest and richest deciles declined by 0.022, from .089 to .067 (see Table 5). Said differently, 
improvements in the measured teacher qualifications in the poorest decile of schools reduced the 
gap resulting from observed differences by 25 percent. 

The reduction in the achievement gap resulting from improved teacher qualifications is 
robust to several alternative specifications. As shown in Table 5, if instead of imputing the SAT 
and LAST exams, we drop the math and verbal SAT variables and omit observations that are 
missing the LAST, the poorest decile shows greater improvement and the gap closes by more 
than in our base model. If instead, we include the SAT variables and omit observations with 
missing values, gains to the poorest decile are much greater as is the gap closing. Finally we 
estimate a model similar to the Base model that employs current achievement levels as the 
dependent variable with lagged student achievement and school fixed effects instead of a gain 
model with student fixed effects. In these estimates the gap closes by 0.029. 

One way of summarizing these results is to examine what portion of the original gap 
between the most affluent and poorest deciles each model would predict is eliminated as a result 
of improved teacher qualifications. As shown in the last row of Table 5, across four quite 
different specifications the percentage of gap reduction attributable solely to observed teacher 
qualification varies between 20 and 28 with the base model predicting 25 percent. These 
predicted effects include the effect of the reduction in the teacher experience gap. If that effect 
were held constant, there would still be a narrowing of the gap in student achievement gains of 
.018 in the Base model, as shown in the last column of Table 5. Thus, about 80 percent of the 
reduction in the original gap between schools with poor and more affluent students is attributable 
to qualifications other than experience. 

As noted above, the change in teacher sorting has been driven almost exclusively by new 
teachers. Many teachers in a school remain unchanged over any five year period and thus when 
examining the effect of changes in teacher qualifications, these observations do not contribute to 
improved student achievement (except for the net gains to experience). The prior analyses 
predict student achievement based on the full sample of teachers. The results are predictably 
stronger if we look only at teachers in their first or second year of teaching. As shown in the 



15 




second column of Table 5, achievement predicted only from the observable qualifications of first 
and second-year teachers in the poorest decile of schools improves by 0.044 from 2001 to 2005 — 
about two thirds of the gain estimated to accrue to teachers after their first year of teaching. The 
gap in student achievement between poor and more affluent schools was reduced by .041. Thus 
the changes in teacher qualifications alone that occurred in New York City’s poorest schools 
between 2000 and 2005 had a meaningful effect on 4* and 5* grade math achievement. 

In addition to explaining a moderate proportion of the change in achievement across 
schools, the results show that there is a substantial difference between the teachers in predicted 
student achievement gains based solely on observable qualifications. As is apparent in any of the 
achievement distributions in Figure 9, there are meaningful achievement differences between 
higher and lower performing teachers solely attributable to observed teacher qualifications. 
Consider only 4* and 5* grade teachers whose students are in the quartile of schools with the 
highest rates of student poverty. The difference between the average value added attributable 
solely to teacher qualifications for those teachers in the top and bottom quintiles of this 
distribution is 0.16 — roughly three times the effect of the gains attributable to the first year of 
teacher experience. Table 6 shows how these values change over the quintiles of value added for 
teachers in the poorest quartile of schools. It also shows the average qualifications of teachers in 
each of these quintiles. There are important differences in qualifications between teachers who 
produce the highest and lowest value added students, even among teachers working in poorest 
quartile of schools. Those with the weakest value added tend to be inexperienced, have failed the 
LAST certification exam the first time they took it, be uncertified at the time they teach the class, 
and have low math SAT scores. 

The conclusion arising from this analysis is clear. The performance of students in 4'^ and 
S"' grade math can be substantially increased across all stratifications of students by recruiting 
and hiring better qualified teachers. 

The effects of observed teacher qualifications on student achievement are more modest 
for middle school math. Figure 10 shows the how the narrowing of differences in teacher 
qualifications from 2001 to 2005 corresponds to improvement in student achievement of 0.015 
for the poorest decile, but to virtually no change in the gap between the poorest and the most 
affluent deciles. If limited to only teachers in their first or second year of teaching the poorest 
decile improves by 0.020 standard deviations. The smaller effects for middle school achievement 

*** Of course, unobserved measures of teacher qualifications, such as motivation, that are positively 
correlated with the observed measures, may also be contributing to these effects. However, from a 
recruitment perspective if these unobserved measures are consistently correlated with the observed 
measures, the effect for improved student achievement is not altered. 



16 




are fully consistent with the smaller changes in teacher qualifications noted above and in 
Appendix Table 3b. Nonetheless, there are meaningful within decile differences in the predicted 
effects of observed teacher qualifications of the least and most effective teachers, and thus, again, 
recruiting more qualified teachers could meaningfully improve achievement outcomes. 

VI. Conclusions 

The gap between the qualifications of New York City teachers in high-poverty schools 
and low-poverty schools has narrowed substantially since 2000. For example, in 2000 teachers in 
the highest-poverty decile of schools had verbal SAT scores that on average were 59 points lower 
than their counterparts in the lowest poverty decile of schools. By 2005 this gap had narrowed to 
39 points. The same general pattern held for other teacher qualifications such as the failure-rate 
on the Liberal Arts and Sciences (LAST) teacher certification exam, the percentage of teachers 
who attended undergraduate college at “least competitive” institutions, and the percentage of 
teachers in a school who are novices with less than three years of New York City teaching 
experience. Most of this gap-narrowing resulted from changes in the characteristics of newly 
hired teachers, rather than from differences in rates and transfers rates between high- and low- 
poverty schools. 

The gap-narrowing associated with new hires has been largely driven by the virtual 
elimination of newly hired uncertified teachers coupled with an influx of teachers with strong 
academic backgrounds in the Teaching Fellows program and Teach for America (TFA). Only five 
percent of newly hired Teaching Fellows and TFA teachers in 2003 had failed the LAST exam on 
their first attempt, while 16.2 percent of newly hired traditional teachers had failed the LAST 
exam, and fully 32.5 percent of uncertified teachers had failed the LAST exam. In 2005, 43 
percent of all new teachers in the quartile of schools with the poorest students were Teaching 
Fellows or TFA teachers. 

The improvements in teacher qualifications, especially among the poorest schools, appear 
to have resulted in improved student achievement. By estimating the effect of teacher attributes 
using a value-added model, the analyses above predict that observable qualifications of teachers 
resulted in average improved achievement for students in the poorest decile of schools of .03 
standard deviations, about half the difference between being taught by a first year teacher and a 
more experienced teacher. If limited to teachers who are in the first or second year of teaching, 
where changes in qualifications are greatest, the gain equals two-thirds of the first-year 
experience effect. 



17 




These changes resulted from policy interventions that changed the qualifications of the 
teachers of poor, minority and low achieving students in New York City. In particular, these 
changes can he attributed to the New York State policy that eliminated uncertified teachers and 
the New York City policy that established the Teaching Fellows program and, to a lesser extent, 
employed Teach for America teachers. The sorting of the least qualified teachers to the students 
most in need of better teachers is not destiny, but it requires forceful action by policy makers and 
a commitment by local hiring authorities to attract more highly qualified teachers. 

Perhaps most intriguing, much larger gains could result if teachers with strong teacher 
qualifications could be recruited. Among teachers teaching 4* and 5* grade math students in 
schools with the highest proportions of students in poverty, we found there are substantial 
differences in student achievement solely attributable to differences in observed teacher 
qualifications. The top quintile has value added that differs from the bottom quintile by three 
times the effect accruing to the first year of experience. Thus, recruitment can substantially 
change outcomes for students. 

Producing better student achievement likely results from several complementary 
strategies. Clearly a large proportion of the variation in teacher effectiveness in improving 
student achievement is not related to measurable teacher characteristics such as test scores or 
certification. Because of this, policies that enable school leaders to better understand the 
strengths and weaknesses of each teacher so that they can target professional development and 
effectively utilize the due -process system to continually improve the teacher workforce are likely 
to be important. However, this paper suggests that selection of teachers with stronger 
qualifications has made an important difference in New York City public schools and that 
recruitment and retention of teachers with stronger measurable characteristics can lead to 
improved student learning. 



18 




% of teachers 



Figure 1: Percentage of New York City Teachers With Less than 3 Years of 
Experience, By Poverty Grouping of School’s Students, 2000 




10% 25th 90th 10% 

percentile percentile 

Poverty: %of students eligible for free lunch 



19 



Figure 2: Percent of All New York City Teachers Who Failed the LAST Exam on First 
Taking hy Poverty Quartile of School’s Students, 2000-2005 




Figure 3: Percent of All New York City Teachers Who are Novices 
hy Poverty Quartile of School’s Students, 2000-2005 




- ♦ ■ Lowest quartile ■ 2nd quartile — * ■ 3rd quartile • Highest quartile 



20 



Figure 4: Change in Proportion of Teachers Failing LAST Exam on First Taking 
Between 2000 and 2005 hy Poverty Decile of School’s Students 




21 



Figure 5: Average LAST First Time Failure Rate of Those Teaching in 2000 and the Effect 

of Their Transfer and Quits Over Time 




Figure 6: LAST Exam Eailure Rate of New Teachers by Poverty Quartile 
of School’s Students, 2000-2005 




- ♦ ■ Lowest quartile ■ 2nd quartile — ^ - 3rd quartile • Highest quartile 



22 




Figure 8: Improvements in Math Student Achievement Attributable to Additional Teacher 

Experience 




23 



Figure 9: Effect of Observed Teacher Qualiflcations on Students in Grades 4 & 5 Math 
Achievement, Most Affluent and Poorest Deciles of Schools, 2001 and 2005 




Rch 2001 Poor 2001 

Rch 2005 Poor 2005 



Figure 10: Effect of Observed Teacher Qualifications on Students in Grades 6 - 8 Math 
Achievement, Most Affluent and Poorest Deciles of Schools, 2001 and 2005 




Rich 2001 Poor 2001 

Rich 2005 Poor 2005 



24 



Table 1 

Qualifications of Teachers by Poverty Status of Schools in Which They Taught in 2000 

>1 0th to >75th to Highest 10% 

25th 90th minus 

Teacher Attribute Lowest 10% percentiie 2nd quartiie 3rd quartiie percentiie Highest 10% Lowest 10% 



% with less than 3 years of NYC 
teaching experience 


14.7% 


18.6% 


% who failed LAST exam on first 
attempt 


12.2% 


16.8% 


% who attended least 
competitive undergraduate 
institutions 


23.5% 


22.9% 


SAT verbal score 


506 


487 


SAT math score 


490 


477 


Average expenditures per pupil* 


8,002 


8,335 


% Eligible for Free Lunch 


21.6% 


50.4% 



* All 2000 dollars adjusted to 2005 school year dollars using the CPI. 



20.8% 


22.9% 


25.1% 


25.4% 


10.7% 


23.5% 


29.6% 


35.3% 


34.2% 


22.0% 


23.5% 


25.3% 


27.5% 


27.4% 


3.9% 


481 


472 


465 


461 


-45 


468 


461 


451 


447 


-43 


8,338 


8,738 


9,093 


9,479 


1,520 


67.6% 


81.6% 


90.5% 


96.3% 


74.7% 



25 





Table 2: Average School Qualifications of Teachers by Student Poverty, 2000 and 2005 



2000 



Gap: 

Highest 

Highest 10%- 





Lowest 1 0% 


10% 


Lowest 1 0% 


% with iess than 3 years of 
NYC teaching experience 


14.7% 


25.4% 


10.7% 


% who faiied LAST exam on 
first attempt 


1 2.2% 


34.2% 


22.0% 


% who attended ieast 
competitive undergraduate 
institutions 


23.5% 


27.4% 


3.9% 


SAT verbai score 


506 


461 


-45 


SAT math score 


490 


447 


-43 


Number of absences 


na 


na 




Expenditures per pupii* 


$8,002 


$9,479 


$1,477 


Teacher saiary 


na 


na 





* Aii 2000 doiiars adjusted to 2005 schooi-year doiiars using the CPi. 





2005 




Change from 2000 to 2005 


Lowest 1 0% 


Highest 

10% 


Gap: 

Highest 
1 0% - 

Lowest 1 0% 


Lowest 1 0% 


Highest 

10% 


Change in 
Gap 


15.1% 


21.7% 


6.6% 


0.4% 


-3.7% 


-4.1% 


1 3.4% 


24.7% 


1 1 .3% 


1 .2% 


-9.5% 


-10.7% 


26.7% 


24.3% 


-2.4% 


3.2% 


-3.1% 


-6.3% 


503 


485 


-18 


-3 


23 


-26 


495 


471 


-23 


5 


24 


-19 


10.0 


10.8 


0.7 


na 


na 


na 


$9,711 


$1 1 ,866 


$2,155 


$1,709 


$2,387 


$677 


$59,314 


$53,830 


-$5,484 


na 


na 


na 



26 




Table 3: Percentage of New York City Students Failing to Meet Proficiency on Achievement Exams 

by Test and Poverty Decile, 2000 and 2005 



2000 




2005 




Change from 2000 to 2005 


Highest 

Lowest 1 0% 1 0% 


Gap: High 
10% - 

lowest 1 0% 


Highest 

Lowest 1 0% 1 0% 


Gap: High 
1 0% - 

lowest 1 0% 


Highest Change in 
Lowest 1 0% 1 0% gap 



Percent failing to meet proficiency 

ELA grade 4 29.6 73.7 


44.2 


18.1 


50.5 


32.4 


-11.5 


-23.2 


-11.8 


Math grade 4 


24.3 


71.1 


46.8 


7.7 


29.2 


21.5 


-16.6 


-41.8 


-25.2 


ELA grade 8 


37.5 


78.4 


40.9 


41.3 


76.2 


35.0 


3.7 


-2.2 


-5.9 


Math grade 8 


51.9 


85.6 


33.7 


38.9 


69.4 


30.5 


-13.1 


-16.2 


-3.2 


Percent achieving highest level 

ELA grade 4 25.6 


2.8 


-22.8 


32.0 


8.1 


-23.9 


6.4 


5.4 


-1.1 


Math grade 4 


26.8 


2.5 


-24.3 


56.0 


19.3 


-36.7 


29.2 


16.8 


-12.4 


ELA grade 8 


18.9 


3.1 


-15.8 


13.7 


1.6 


-12.1 


-5.2 


-1.5 


3.8 


Math grade 8 


10.1 


1.2 


-8.9 


15.0 


2.4 


-12.6 


4.9 


1.2 


-3.7 


Mean test scores 


ELA grade 4 


665.3 


620.3 


-44.9 


679.8 


643.4 


-36.4 


14.5 


23.1 


8.6 


Math grade 4 


657.7 


617.3 


-40.4 


684.6 


651.4 


-33.2 


26.9 


34.1 


7.3 


ELA grade 8 


710.3 


668.3 


-42.1 


706.0 


681.2 


-24.8 


-4.4 


12.9 


17.3 


Math grade 8 


711.8 


676.2 


-35.6 


725.0 


698.3 


-26.7 


13.2 


22.1 


8.9 



27 




Table 4: Base Model for Math Grades 4 & 5 with Student Fixed Effects, 2000-2005 



Constant 


0.17147 


SD ELA score t-1 


-0.02332 


14 


0.1263 


Not certified 


-0.04235 




[1.51] 




[1.91] 




[8.21]** 




[5.72]** 


Student changed schools 


-0.03712 


SD math score t-1 


-0.11722 


15 


0.1252 


Barrons undergrad college 






[6.60]** 




[8.27]** 




[6.82]** 


Most competitive 


0.01498 


Class Variables 




Teacher Variables 




16 


0.12464 




[1.48] 


Proportion Hispanic 


-0.4576 


Experience 






[6.36]** 


Competitive 


0.01426 




[12.89]** 


2 


0.06549 


17 


0.08298 




[2.24]* 


Proportion Black 


-0.57974 




[10.61]** 




[3.10]** 


Least Competitive 


0.00686 




[16.16]** 


3 


0.1105 


18 


0.14161 




[1.25] 


Proportion Asian 


-0.07711 




[16.56]** 




[4.02]** 


Imputed Math SAT 


0.00043 




[1.75] 


4 


0.13408 


19 


0.13686 




[9.05]** 


Proportion other 


-0.56887 




[17.91]** 




[2.62]** 


Imputed Verbal SAT 


-0.00034 




[3.95]** 


5 


0.117 


20 


0.24658 




[6.06]** 


Class size 


0.002 




[14.24]** 




[2.50]* 


SAT missing 


-0.01535 




[3.36]** 


6 


0.13365 


21 or more 


0.38977 




[2.94]** 


Proportion Eng Lang Learn 


-0.42941 




[14.58]** 




[3.89]** 


Initial path into teaching 






[14.16]** 


7 


0.12307 


Cert pass first 


0.00657 


Individual evaluation 


-0.02243 


Proportion home lang Eng 


-0.02902 




[12.27]** 




[0.94] 




[2.81]** 




[1.16] 


8 


0.11898 


Imputed LAST score 


0.00025 


NYC Teaching Lellows 


-0.01935 


Proportion free lunch 


-0.00181 




[10.81]** 




[0.57] 




[1.89] 




[0.01] 


9 


0.12433 


LAST missing 


0.00188 


Teach for America 


-0.00744 


Proportion reduced lunch 


0.10521 




[10.04]** 




[0.26] 




[0.37] 




[3.40]** 


10 


0.13693 


Certified Math 


0.07086 


Temporary License 


-0.03109 


Mean absences t-1 


-0.01367 




[9.85]** 




[1.30] 




[4.95]** 




[15.10]** 


11 


0.12592 


Certified Science 


-0.04852 


Other 


-0.03246 


Mean suspensions t-1 


0.14069 




[9.41]** 




[0.95] 




[2.31]* 




[2.78]** 


12 


0.10209 


Certified special ed 


0.01086 


Teacher LAST* 


-0.00024 


Mean ELA score t-1 


0.33811 




[7.66]** 




[1.05] 


class proportion free lunch 


[0.49] 




[31.29]** 


13 


0.11831 


Certified other 


-0.00521 






Mean math score t-1 


-0.88479 




[8.23]** 




[0.62] 








[58.78]** 










Observations 


578,630 



28 




Table 5: Effect of Observed Teacher Qualifications on Student Grades 4 & 5 Math Achievement, Most Affluent and Poorest 

Deciles of Schools, 2001 and 2005 for Various Model Specifications* 





Imputed SAT and LAST 
All Obs Exp < 3 


Drop SAT 
Variables 


Drop Missing 
SAT Obs 


School FE 


No 

Experience 


Most affluent decile 














2001 


0.049 


-0.011 


0.093 


0.129 


0.074 


0.050 


2005 


0.056 


-0.008 


0.102 


0.125 


0.077 


0.048 


Change 


0.007 


0.003 


0.009 


-0.004 


0.004 


-0.002 


Poorest decile 














2001 


-0.040 


-0.106 


-0.053 


-0.083 


-0.047 


-0.032 


2005 


-0.011 


-0.062 


-0.015 


-0.027 


-0.014 


-0.016 


Change 


0.029 


0.044 


0.038 


0.056 


0.033 


0.016 


Gap between most affluent and poorest decile 












2001 


0.089 


0.095 


0.146 


0.212 


0.121 


0.082 


2005 


0.067 


0.054 


0.117 


0.152 


0.091 


0.064 


Change 


-0.022 


-0.041 


-0.029 


-0.060 


-0.029 


-0.018 


Percentage reduction in gap 


24.8 


43.0 


19.7 


28.4 


24.3 


21.9 



* Base model is as shown in Table 4; Exp < 3 includes only teachers in their first two years of teaching; Drop SAT variables omits the SAT variables 
from the estimation; Drop Missing SAT obs omits any teacher for whom we do not observe SAT scores, which has the effect of eliminating about 45 
percent of the observations; School Fixed Effect substitutes school fixed effects for student fixed effects in the Base Model; No Experience is the base 
model with teacher experience omitted from the predictions. 



29 




Table 6: Average Qualifications of Teachers in Poorest Quartile of Schools by Math Achievement Quintiles 

Predicted Solely from Teacher Qualifications, 2000-2005 



VA 

Quintile 


Mean VA 


Years 

Experience 


LAST Pass 
First 


Not 

Certified 


LAST 

Score 


Math 

SAT 


Verbal 

SAT 


Barrens Ranking of Undergraduate College 

Most Less Not 

Competitive Competitive Competitive compet 


1 


- 0.103 


2.054 


0.653 


0.626 


238 


423 


478 


0.135 


0.136 


0.442 


0 . 28 ' 


2 


- 0.033 


5.324 


0.638 


0.272 


242 


421 


466 


0.102 


0.096 


0.493 


0.301 


3 


- 0.003 


6.867 


0.715 


0.063 


244 


433 


469 


0.078 


0.095 


0.516 


o . 3 i ; 


4 


0.021 


6.546 


0.777 


0.022 


247 


446 


461 


0.105 


0.153 


0.415 


0 . 32 ' 


5 


0.059 


5.944 


0.872 


0.007 


252 


489 


459 


0.162 


0.229 


0.389 


0 . 21 ! 


Range 


0.162 


3.890 


0.219 


- 0.619 


14 


66 


-18 


0.027 


0.093 


- 0.052 


- 0.06 



30 



Appendix Table 1 

Average School Qualifications of Teachers by Percent of Students in School Who are Black or Hispanic, 2000 and 2005 



2000 



Gap: 

Highest 

Highest 10%- 





Lowest 1 0% 


10% 


Lowest 1 0% 


% with less than 3 years of 
NYC teaching experience 


14.4% 


26.3% 


1 1 .9% 


% who failed LAST exam on 
first attempt 


13.9% 


37.0% 


23.1% 


% who attended least 
competitive undergraduate 
institutions 


28.4% 


30.1% 


1 .7% 


SAT verbal score 


490 


458 


-33 


SAT math score 


480 


440 


-40 


Number of absences 


na 


na 




Expenditures per pupil* 


$8,140 


$8,923 


$783 


Teacher salary 


na 


na 





* All 2000 dollars adjusted to 2005 school-year dollars using the CPI. 





2005 




Change from 2000 to 2005 


Lowest 1 0% 


Highest 

10% 


Gap: 

Highest 
1 0% - 

Lowest 1 0% 


Lowest 1 0% 


Highest 

10% 


Change in 
Gap 


14.7% 


19.8% 


5.1% 


0.3% 


-6.5% 


-6.8% 


15.0% 


28.3% 


13.3% 


1.1% 


-8.7% 


-9.8% 


30.1% 


29.4% 


-0.7% 


1 .7% 


-0.7% 


-2.4% 


493 


472 


-20 


2 


15 


-12 


487 


457 


-30 


6 


17 


-10 


10.6 


10.6 


0.0 


na 


na 


na 


$10,940 


$1 1 ,675 


$735 


$2,800 


$2,752 


-$49 


$59,472 


$54,019 


-$5,453 


na 


na 


na 



31 




Appendix Table 2 

Average School Qualifications of Teachers by Percent of Students in School Who Scored at Level 1 On 4*** Grade EL A Exam, 2000 

and 2005 



2000 



Gap: 

Highest 





Lowest 1 0% 


Highest 

10% 


1 0% - 

Lowest 1 0% 


% with less than 3 years of 
NYC teaching experience 


1 8.3% 


31 .5% 


13.2% 


% who failed LAST exam on 
first attempt 


1 3.6% 


39.1% 


25.5% 


% who attended least 
competitive undergraduate 
institutions 


22.8% 


30.9% 


8.1% 


SAT verbal score 


490 


458 


-32 


SAT math score 


475 


440 


-35 


Number of absences 


na 


na 




Expenditures per pupil* 


$8,135 


$10,124 


$1,989 


Teacher salary 


na 


na 





* All 2000 dollars adjusted to 2005 sohool-year dollars using the CPI. 





2005 




Change from 2000 to 2005 


Lowest 1 0% 


Highest 

10% 


Gap: 

Highest 
1 0% - 

Lowest 1 0% 


Lowest 1 0% 


Highest 

10% 


Change in 
Gap 


1 6.4% 


18.0% 


1 .6% 


-1 .9% 


-13.5% 


-11.6% 


1 5.4% 


25.7% 


10.3% 


1 .8% 


-13.4% 


-15.2% 


22.3% 


28.3% 


6.0% 


-0.5% 


-2.6% 


-2.1% 


494 


475 


-19 


4 


16 


-12 


486 


458 


-28 


11 


18 


-7 


10.6 


10.9 


0.2 


na 


na 


na 


$10,197 


$14,410 


$4,214 


$2,062 


$4,287 


$2,225 


$57,941 


$54,566 


-$3,375 


na 


na 


na 



32 




Appendix Table 3a 

Average School Qualifications of Teachers In Elementary Schools by Student Poverty, 2000 and 2005 



2000 




2005 




Change from 2000 to 2005 


Highest 

Lowest 1 0% 1 0% 


Gap: 

Highest 

10%- 

Lowest 1 0% 


Highest 

Lowest 1 0% 1 0% 


Gap: 

Highest 

10%- 

Lowest 1 0% 


Highest Change in 
Lowest 1 0% 1 0% Gap 



Elementary 

% with less than 3 years of 
NYC teaching experience 


1 5.7% 


27.6% 


1 1 .9% 


14.4% 


20.0% 


5.6% 


-1.3% 


-7.6% 


-6.3% 


% who failed LAST exam on 
first attempt 


1 0.4% 


37.7% 


27.3% 


12.3% 


26.7% 


14.4% 


1 .9% 


-11.0% 


-12.9% 


% who attended least 
competitive undergraduate 
institutions 


24.7% 


29.2% 


4.5% 


27.2% 


26.3% 


-0.9% 


2.5% 


-2.9% 


-5.4% 


SAT verbal score 


502 


452 


-50 


496 


474 


-22 


-6 


22 


-28 


SAT math score 


482 


435 


-47 


486 


459 


-27 


4 


24 


-20 



33 




Appendix Table 3b 

Average School Qualifications of Teachers In Middle Schools by Student Poverty, 2000 and 2005 



2000 




2005 




Change from 2000 to 2005 


Highest 

Lowest 1 0% 1 0% 


Gap: 

Highest 

10%- 

Lowest 1 0% 


Highest 

Lowest 1 0% 1 0% 


Gap: 
Highest 
1 0% - 

Lowest 1 0% 


Highest Change in 
Lowest 1 0% 1 0% Gap 



Middle School 

% with less than 3 years of 
NYC teaching experience 


1 6.7% 


28.0% 


1 1 .3% 


15.2% 


26.4% 


1 1 .2% 


-1.5% 


-1 .6% 


-0.1% 


% who failed LAST exam on 
first attempt 


1 5.7% 


32.0% 


1 6.3% 


14.6% 


27.2% 


1 2.6% 


-1.1% 


-4.8% 


-3.7% 


% who attended least 
competitive undergraduate 
institutions 


24.2% 


27.7% 


3.5% 


30.7% 


24.8% 


-5.9% 


6.5% 


-2.9% 


-9.4% 


SAT verbal score 


501 


473 


-28 


497.8 


489.3 


-9 


-3 


16 


-19 


SAT math score 


517 


483 


-34 


492.8 


475.0 


-18 


-24 


-8 


-16 



34 




Appendix Table 3c 

Average School Qualifications of Teachers In High Schools by Student Poverty, 2000 and 2005 



2000 




2005 




Change from 2000 to 2005 


Highest 

Lowest 1 0% 1 0% 


Gap: 

Highest 

10%- 

Lowest 1 0% 


Highest 

Lowest 1 0% 1 0% 


Gap: 
Highest 
1 0% - 

Lowest 1 0% 


Highest Change in 
Lowest 1 0% 1 0% Gap 



High School 

% with less than 3 years of 
NYC teaching experience 


1 0.7% 


18.2% 


7.5% 


16.6% 


22.9% 


6.3% 


5.9% 


4.7% 


-1.2% 


% who failed LAST exam on 
first attempt 


1 5.7% 


32.0% 


1 6.3% 


15.5% 


17.7% 


2.2% 


-0.2% 


-14.3% 


-14.1% 


% who attended least 
competitive undergraduate 
institutions 


19.8% 


22.0% 


2.2% 


22.5% 


18.6% 


-3.9% 


2.7% 


-3.4% 


-6.1% 


SAT verbal score 


522.0 


485.9 


-36 


526.7 


513.8 


-13 


5 


28 


-23 


SAT math score 


516.5 


482.8 


-34 


520.4 


504.8 


-16 


4 


22 


-18 



35 




References 



Aaronson, D., L Barrow, W Sander (2003) “Teachers and Student Achievement in the Chicago 
Public Schools” Federal Reserve Bank of Chicago WP-2002-28. 

Betts, J., K. Ruehen, K. Danenherg, (2000) Equal Resources, Equal Outcomes? The Distribution 
of School Resources and Student Achievement in California, Public Policy Institute of 
California. 

Bonesr0nning, Falch, and Str0m (2005) “Teacher sorting, teacher quality, and student 
composition” European Economic Review, Vol. 49, 457-83 

Boyd D., P. Grossman, FI. Lankford, S. Loeb & J. Wyckoff (2006) “Flow Do Various Attributes 
of Teachers Affect Students’ Test-Score Gains?” working paper. 

Boyd D., P. Grossman, H. Lankford, S. Loeb & J. Wyckoff (2007) “Who Leaves? Teacher 
Attrition and Student Achievement” working paper. 

Boyd, Donald, Flamilton Lankford, Susanna Loeb and James Wyckoff, “Initial Matches, 

Transfers, and Quits: Career Decisions and the Disparities in Average Teacher Qualifications 
Across Schools” Working Paper. 

Boyd, Donald, Hamilton Lankford and James Wyckoff “Closing the Student Achievement Gap 
by Increasing the Effectiveness of Teachers in Low-Performing Schools” forthcoming, H. 
Ladd and E. Eiske eds. Handbook of Research in Education Einance and Policy. 

Clotfelter, Eadd and Vigdor, 2006a, “Teacher- Student Matching and the Assessment of Teacher 
Effectiveness” National Bureau of Economic Research Working paper 1 1936 

Clotfelter, C., H. Eadd and J. Vigdor (2006b) “How and Why do Teacher Credentials Matter for 
Student Achievement” Working paper, Duke University. 

Coleman, James S. (1966) Equality of Education Opportunity Study, Washington, DC: U.S. 
Department of Health, Education, and Welfare, Office of Education/National Center for 
Education Statistics. 

Goldhaber, D. (2006) Everyone’s Doing It, but What Does Teacher Testing Tell Us About 
Teacher Effectiveness? Manuscript. 

Kane, Thomas J., Jonah E. Rockoff and Douglas O. Staiger. (2006) “What Does Certification Tell 
Us About Teacher Effectiveness? Evidence from New York City” NBER Working Paper 
12155, April 2006 

Eankford, Hamilton, Susanna Eoeb, and James Wyckoff, “Teacher Sorting and the Plight of 

Urban Schools: A Descriptive Analysis”, Educational Evaluation and Policy Analysis, Spring 
2002, Vol. 24, No. 1, pp. 37-62 

Eoeb, S., and E. Miller (2007). A Review of State Teacher Policies: What Are They, What Are 
Their Effects, and What Are Their Implications for School Einance. Technical Report, 

Getting Down to Eacts Project: Stanford University. 



36 




Peske, Heather and Kati Haycock, Teaching Inequality: How Poor and Minority Students are 
Shortchanged on Teacher Quality, The Education Trust, June 2006. 

Rivkin, S., E. Hanushek, and J. Kain (2005) “Teachers, Schools, and Academic Achievement” 
Econometrica, 73(2), 417-458. 

Rockoff, Jonah (2004). “The Impact of Individual Teachers on Student Achievement: Evidence 
from Panel Data,” American Economic Review 94 (2): 247-252. 

Sanders, W.E. and Rivers, J.C. (1996). "Research Project Report: Cumulative and Residual 
Effects of Teachers on Euture Student Academic Achievement," University of Tennessee 
V alue-Added Research and Assessment Center. 



37 




CALDER 

iii 





